CDS

Accession Number TCMCG024C03338
gbkey CDS
Protein Id XP_021972940.1
Location complement(join(72933287..72933595,72934227..72934464,72934558..72934697,72934784..72934832,72937441..72937769,72938823..72938909,72938999..72939162,72939258..72939414,72940256..72940375,72940873..72940983,72941282..72941458,72942234..72942379,72943365..72943548,72943625..72943762,72943878..72944034,72944118..72944308,72944402..72944644,72944764..72945054))
Gene LOC110868156
GeneID 110868156
Organism Helianthus annuus

Protein

Length 1076aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022117248.2
Definition uncharacterized protein LOC110868156 [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category U
Description PGAP1-like protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03012        [VIEW IN KEGG]
KEGG_ko ko:K03263        [VIEW IN KEGG]
ko:K05294        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00563        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00563        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATAGGCTTCAAGGCTAAATTTCGATTAGCAACGATTGTTGTTCTTTCAATCGGGATCGCTCTTGTTGCTATATATGATTTGTTAAAGCCAATTTCAAATGGTTGTACCATGACATACATGTACCCAACCTACATTCCGATATCAGCACCTAAAAGTTTATCTTCTTCAAAGTATGGACTGTATTTGTATCATGAAGGGTGGAAACAAATTGATTTTGATGAACATCTTAAACAACTTAACGGGGTTCCTGTTCTTTTCATTCCTGGCAATGGAGGTAGCTACAAGCAGGTCAGGTCCTTAGCGGCAGAATCTGACAGAGCGTATCAAGGGGGCCCACCTGAACCTATGTTGTACCAAGAGGCTTCCTTAATGTTTGAGGGATTAGAGATAGATGAAACGAATATTCCTATACTCAACCAATATGCACGAAGGCTTGACTGGTTTGCAGTTGACCTTGAAGGTGAACATTCTGCAATGGATGGTCAAGTACTCGAAGAACACACAGAATATGTAGTACATGCCATTAACAGGATTCTGGATCAATATAAAGAATCTCAAGATGCTCGAGTAAAAGAAGGTGCTGTTGCATCTGGTAGCCTGCCGAATAGTGTCATATTGGTTGGCCATTCTATGGGCGGTTTTGTTGCTAGAGCTGCTGTTGTGCATCCTAATTTGAGAAAATCAGCTGTTGAAACCATTCTTACACTATCAGCTCCACATCAGTCACCTCCGTTGGCACTGCAACCGTCATTAGGTCACTATTTTGAATACATAAATCAGGAATGGAGAAAGGGATATGAGGTTCAAAACTCTAGAACAGGAGCTCAACTATCTAGAGTGATTGTTATCTCCATTTCTGGTGGTGGTAATGATTACCAGATAAGGTCAAGGTTGGAATCTCTTGATGGTATAGTCCCAACTACCCATGGTTTTATGATCAGTAGCATGGGGGTGAAGAATGTATGGCTATCAATGGAACACCAGGTTATCTTATGGTGTAATCAACTTGTTGTGCAAGTTTCACATACTCTTCTTAGTTTGGTAGACCCTGAAACGGGTCACCCGACTTCTGGCCCAAGGAAAAGACTAGCAATATTTACAAAAATGCTTCAAAGTGGAATGCCAGGAAGTTTGTCAGGGCGATCGGATTTTCATCAGCAATCGCCACGTCTTCCTTTACTGAAAGGGAGAAACTTTTTTGGATCTGTGAGAAAAAATATTACCGCATGTCCCAGTAAAATTCGTTGGAGCGATGAGGGACTTGAAAGGGATCTTTACATCAAGACACCAACAGTAACTATTTTAGCAATGGATGGCAGAAGACGTTGGTTGGACATAAAAGAACTGGGGTCAGATGGAAGAACCAACTTTGTTCTTGTGACAAATCTTCTTCCCTGCTATGGAGTCAGACTTCATCTTTGGCCCGAAAAAGGAACTTCTATGTCAAATTCGCCTCTTAGCAAAAGGGTTGTAGAAGTGACATCAAGAATTGTTCAGGTTCCATCAGGACCAGCACCAAGGCAGATTGAACCAGGAAGTCAGACGGAACAACCACCACCATCAGCTGTATTTTGGTTGGACCCTAAGGATATGCATGGTTTCAGATTCCTTACAATCTCAGTTGCACCAAGCCCAACTGTTTCAGGGAGACCTCCACCCGCAGCTTCAATGGCAGTTGGGCAGTTTTTCAACCCAGAGGAAGGCCGTAAAGAATTTTCTCCTAAATCGTTGCTTCTTTCTGTGTTTTCTCAGAAGGACATTTTCATTAAGGAGGATCATCCTATTGTCATGAATATTACATTCAGTATCAACTTAGGGCTTTTGCCAGCTACAGTTTCTCTAGAAACTACAGGTTGTGGAATAAAAAAGTCTGGACTTCCTGTTGAAGAAGCTGGAGACATGAATAGTGGCAGACTTTGCAAGCTAAGATGCTTTCCACCTGTCGCCCTTACGTGGGACCCTGCATCTGGTCTTAACATATTTCCTAATTTAAATTCTCGTACAATAGAAGTTGACTCATCACCTGCACTTTGGAGCTCAACTCAGGGATCTGAGCAGAGCAATGTTTTGTTACTGGTTGACCCACATTGTTCGTATAAAACTAGTGCTGCTGTTTCTCTAACTGCTTCTGCCCGGAGGTTTATGCTTCTATATAACTCCCAGATCGTTGGTTTCTCGTTTTCTGTAGTATTCTTTGCCCTAATGAGGCAAGCTAATGCATGGGAGCTTGGTTTTCCTGTTCCTTCGTTGCTGTCTGCAGTAGAATCAAATCTGAGAATGCCACTGCCGTTTCTTTCACTTGCTATTTCACCCGTTCTGATTGCTTTGTTTTATTCCTATCTAAGTTCAAAGTCGTTACCTTCAGTTGGTAGTTTCTTTGTTGTCTCAATGATTTGCTATCTAATAGCAAACGGGACCGTAATAGTTTTAGTATTAACCACACTAATCCTCTCCCATTTGGTTGCCAGAATACACGTCTTCTTCAAGACAAGGTGGAGATGGTGGATACTTGATCTCTCCACTAGTTTTTTCTCATTTAAGGTCACTAGGGTCATAAACGCCAATCCGTCATTGGCCACATCACTTCTTGCTATCGTTTTAGTCTATTTTGTCCATCCAGCTTTAGGTCTCCTTATTCTGGTGTTCTCACATGTGCTATGCTGTCACCATGCATTGTGCAGCTTTCTTACAGCGTCCAGTGAGGCTCGGGCCGAAGATTTATTCGGGTTTGGAAGTGGAATCAAGAAAATTGAATCGGAGGTGGGATTACCTGTGGATGACCACAGTTCCAGCTCCCCGGACTCTACGAGAAGCTATGGTGACACACAACTAGAGATGTTCCACCACCGCCATGGCTTGCTAATTCTCGATCTTCTTTGTCTGCTCATGTTTCTTCCTTCACTCGTTGCTTGGTTGGAGAGGCTAAGCATGAGCCACAACTTCCCATGGCTCTTGGATTCCATGCTCTGCATGGGTATCATTCTGCACGGTATCTGCAACTCGAAACCCGAGTTTAACGTCTTCTTTCAGATACCCGGAACGAGAGGGTATGAAATTAGACAAGGGTTTGTGTACATGATTGCTGGGTATTGTTGTTATCTTTGGGGTTTAGATTTAGCTCCTTACAAAGCTTTCTATGCTATAGCTGCCATAGGAGTGGTATCATTTATTTTCAGAATTATGGAAAGAAGAAACCGAAACAGTAGAAAGCATTCCCATCGACACTGA
Protein:  
MIGFKAKFRLATIVVLSIGIALVAIYDLLKPISNGCTMTYMYPTYIPISAPKSLSSSKYGLYLYHEGWKQIDFDEHLKQLNGVPVLFIPGNGGSYKQVRSLAAESDRAYQGGPPEPMLYQEASLMFEGLEIDETNIPILNQYARRLDWFAVDLEGEHSAMDGQVLEEHTEYVVHAINRILDQYKESQDARVKEGAVASGSLPNSVILVGHSMGGFVARAAVVHPNLRKSAVETILTLSAPHQSPPLALQPSLGHYFEYINQEWRKGYEVQNSRTGAQLSRVIVISISGGGNDYQIRSRLESLDGIVPTTHGFMISSMGVKNVWLSMEHQVILWCNQLVVQVSHTLLSLVDPETGHPTSGPRKRLAIFTKMLQSGMPGSLSGRSDFHQQSPRLPLLKGRNFFGSVRKNITACPSKIRWSDEGLERDLYIKTPTVTILAMDGRRRWLDIKELGSDGRTNFVLVTNLLPCYGVRLHLWPEKGTSMSNSPLSKRVVEVTSRIVQVPSGPAPRQIEPGSQTEQPPPSAVFWLDPKDMHGFRFLTISVAPSPTVSGRPPPAASMAVGQFFNPEEGRKEFSPKSLLLSVFSQKDIFIKEDHPIVMNITFSINLGLLPATVSLETTGCGIKKSGLPVEEAGDMNSGRLCKLRCFPPVALTWDPASGLNIFPNLNSRTIEVDSSPALWSSTQGSEQSNVLLLVDPHCSYKTSAAVSLTASARRFMLLYNSQIVGFSFSVVFFALMRQANAWELGFPVPSLLSAVESNLRMPLPFLSLAISPVLIALFYSYLSSKSLPSVGSFFVVSMICYLIANGTVIVLVLTTLILSHLVARIHVFFKTRWRWWILDLSTSFFSFKVTRVINANPSLATSLLAIVLVYFVHPALGLLILVFSHVLCCHHALCSFLTASSEARAEDLFGFGSGIKKIESEVGLPVDDHSSSSPDSTRSYGDTQLEMFHHRHGLLILDLLCLLMFLPSLVAWLERLSMSHNFPWLLDSMLCMGIILHGICNSKPEFNVFFQIPGTRGYEIRQGFVYMIAGYCCYLWGLDLAPYKAFYAIAAIGVVSFIFRIMERRNRNSRKHSHRH